A discourse coding scheme for conversational Spanish

نویسندگان

  • Lori S. Levin
  • Ann E. Thymé-Gobbel
  • Alon Lavie
  • Klaus Ries
  • Klaus Zechner
چکیده

This paper describes a 3-level manual discourse coding scheme that we have devised for manual tagging of the CallHome Spanish (CHS) and CallFriend Spanish (CFS) databases used in the CLARITY project. The goal of CLARITY is to explore the use of discourse structure in understanding conversational speech. The project combines empirical methods for dialogue processing with state-of-the art LVCSR (using the JANUS recognizer). The three levels of the coding scheme are (1) a speech act level consisting of a tag set extended from DAMSL and Switchboard; (2) dialogue game level defined by initiative and intention; and (3) an activity level defined within topic units. The manually tagged dialogues are used to train automatic classifiers. We present preliminary results for statement categorization, and give an in-progress report of automatic speech act classification and topic boundary identification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In Proceedings of ICSLP-98 A DISCOURSE CODING SCHEME FOR CONVERSATIONAL SPANISH

This paper describes a 3-level manual discourse coding scheme that we have devised for manual tagging of the CallHome Spanish (CHS) and CallFriend Spanish (CFS) databases used in the CLARITY project. The goal of CLARITY is to explore the use of discourse structure in understanding conversational speech. The project combines empirical methods for dialogue processing with state-of-the art LVCSR (...

متن کامل

Conversational Repairs in Persian Dramatic Discourse: Akbar Radi's Pellekân (The Steps)

The present study is an attempt to investigate conversational repair phenomenon in Persian dramatic discourse and it tries to check the presence of any predominant preference for employing a specific type of repair rather than the others in the context of Persian drama. To reach the aforementioned purpose, Schegloff, Jefferson, and Sacks’s (1977) framework has been adopted and applied to Akbar ...

متن کامل

Emerging temporality: past tense and temporal/aspectual markers in Spanish-speaking children's intra-conversational narratives.

This study describes how young Spanish-speaking children become gradually more adept at encoding temporality using grammar and discourse skills in intra-conversational narratives. The research involved parallel case studies of two Spanish-speaking children followed longitudinally from ages two to three. Type/token frequencies of verb tense, temporal/aspectual markers and narrative components we...

متن کامل

Comedy, Context and Unsaid Meaning: A Case Study in Conversational Implicature

Pragmatics moves away from the word level and sentence level study of language towards the study of language in real-world context and at discourse level whereby two or more participants take part in conversation. There are moments when the speaker explicitly says something but the listener may have other interpretations and inferences from their statements. The aim of this study was to demonst...

متن کامل

Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)

Discourse structure and coherence relations are one of the main inferential challenges addressed by computational pragmatics. The present study focuses on discourse markers as key elements in guiding the inferences of the statements in natural language. Through a rule-based approach for the automatic identification, classification and annotation of the discourse markers in a multilingual parall...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998